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Abstract 

I review some recent work on applications of category theory to questions concerning theo¬ 
retical structure and theoretical equivalence of classical held theories, including Newtonian 
gravitation, general relativity, and Yang-Mills theories. 
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1. Introduction 


There are certain questions that arise in philosophy of science—including the philosophies 
of particular sciences—for which it would be useful to have a formal apparatus by which to 
represent a scientihc theory abstractlyO To this end, i n the context of a sustained critique of 


the so-called “semantic view” of theories. 


Halvorsonl (120121) has suggested that philosophers 


look to category theory for tools and inspiration. The proposal is that for some purposes, 
it is useful to think of a scientihc theory as a collection of (mathematical) models—though 


not, as in the semantic view, a bare set o: 
or more specihcally, a category of models 


models, but rather as a structured set of models. 


Email address: weatherj0uci.edu (James Owen Weatherall) 

^ Of c ourse, there is a lor ig history of pro posals aiming to do just this, beginning with CarnanI ( 1968 [1928] ), 


Russelll (2007 [19271), and Rarnsev (Il93lh . For overviews of this history and the current state of the field, 

see Lutz ( 20151) and HalvorsoT 2W5^7and references therein. _ 

^In some c ases, one might also consider topological structure ( Fletcheil . l2015 ). measure theoreti c structure 
( Curie]!, 20l4l . etc. The idea of using category theory to compare physical theories traces back to Mac Laii3 
( 1968h . who suggested that the Legendre transform between Hamilton and Lagrangian mechanics may be 


understood as a natural transformation—a re s ult con nected wi th recent work on the structu re of classical 
mechanics ( North! . [20091 : iCuriel l2013t iBarrett . 2014a 1. See also Lawvere and Schannel ( 1986!) . 
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Although the present chapter develops this idea, I do not defend ho Instead, my goal 
is to show how the idea can be put to work, by reviewing how it has been fruitfully applied 
to a cluster of related issues concerning symmetry, structure, and equivalence in the context 
of classical ('i.e. , non-quantu m') fi eld theor i es. Con temporary interest in these issues can be 


traced back to 


Stein! ( 19671 1 and 


EarmanI (jl989bl ). who showed how 17th century debates 


concerning substantivalism and relationism in Newtonian gravitation can (and should) be 

with certain geometrical 

structure]^ Meanwhile, 


Earman and Norton! f|l987!) and 


Stachel 


f|l989l) have argued that 


the infamous “hole argument” leads to puzzles about the structure of spacetime in general 
relativityO Recent debates concerning interpretations of Yang-Mills th eory, and so - called 


Healev. 

2001, 

2004, 

- 

0 

0 

Arntzenius. 

2 OI 2 I 


1998 . 


2003; 


I will begin by introducing the category theory that will appear in the sequel. Then I will 
show how this framework re-captures (old) intuitions about relative amounts of structure 
in classical spacetime theories. This discussion will lead to classical electromagnetism and 
a discussion of “gauge” structure. Finally, I will consider more difficult issues arising in 
general relativity and Yang-Mills theory. In these last two cases, I will argue, the tools 
developed in the earlier parts of the paper provide new traction on thorny interpretational 
issues of current interest 


more direct defense can be found in iHalvorsonl (|2015al this volume). Still, I take what follows to 
have some probative value: in my view, fruitful application of the sort described here is the most compelling 
reason to develop a formal program in philoso phy of science. 

'*This particular thread continues: recently, Saunders! ( 20131 1. KnoxI ( 201411 . and Weather^ ( 2015(J l have 
considered new arguments concerning just what structure is presupposed by Ne wton’s Princim a. 

®For up-to-date overviews of the state of the art on the hole argument, see IPoolevI (1201311 and [Norton 


( 1201111 . 

®These issues also connect up with more general concerns in philosophy of science and metaphysics, 
concerning (1) the role of symmetr y in guiding our attempts to extract metaphysical morals from our 
scientific (read: physical!) theories ( Ismael and van Fraassen . 2003t !Bakeil I 2 OI 0 I : Dasguota . 20151 and (2) 
the relationship between the structural features of our theories and various forms of realism ( WorrallL Il989t 
Ladvman and Ro5 l2009HFrench . 2014 1. 

^The material presented here draws on a number of recent papers by the author and collabora- 

























































































Before proceeding, let me flag a worry. Although some of the results I describe in the 
body of the paper are non-trivial, the category theory I use is elementary and, arguably, 
appears only superhcially. Meanwhile, there are areas of mathematics and mathematical 
physics closely related to the theories I discuss—synthetic differential geometry; higher gauge 
theory—where category theory plays a much deeper roleO But as I said above, my goal is 
to review some ways in which thinking of a physical theory as a category of models bears 
fruit for issues of antecedent philosophical interest. The role the category theory ends up 
playing is to regiment the discussion, providing the mathematical apparatus needed to make 
questions of theoretical structure and equivalence precise enough to settle. The fact that 
fairly simple ideas bear low-hanging fruit only provides further motivation for climbing into 
the higher branches. 


2. Structure and Equivalence in Category Theory 

There are many cases in which we want to say that one kind of mathematical object has 
more structure than another kind of mathematical objecty For instance, a topological space 
has more structure than a set. A Lie group has more structure than a smooth manifold. A 
ring has more structure than a group. And so on. In each of these cases, there is a sense 
in which the hrst sort of object—say, a topological space—results by taking an instance 
of the second sort—say, a set—and adding something more—in this case, a topology. In 
other cases, we want to say that two different kinds of mathematical object have the same 
amount of structure. For instance, given a Boolean algebra, I can construct a special kind 
of topological space, known as a Stone space, from which I can uniquely reconstruct the 


tors, including IW eatherall l (2015ali. JWeathei^ ( 2015e i. Weather^ ( 2015cb . Rosenstock et al. ( 2015 1. and 
Rosenstock and Weatherall (|2015alf^ 


^For more on synthetic differential g e ometr y, see KockI ( 2006l l: and fo r appl ications to physics, see 
Mac Lan^ ( 1968 ). Lawyere and Schannel ( 1986h and, more recently, iReyed ( 2011 ). Higher gauge theory 


is described by Baez and Schreibeir ( 2007 ). 

®The ideas in this section are deyeloped in more detail by iBarre t3 (I 2013 I 1 . 
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original Boolean algebra; and vice-versa. 


These sorts of relationships between mathematical objects are naturally captured in the 
language of category theory] via the notion of a forgetful functor. For instance, there is a 
functor F : Top —> Set from the category Top, whose objects are topological spaces and 
whose arrows are continuous maps, to the category Set, whose objects are sets and whose 
arrows are functions. This functor takes every topological space to its underlying set, and it 
takes every continuous function to its underlying function. We say this functor is forgetful 
because, intuitively speaking, it forgets something: namely the choice of topology on a given 
set. 


forgetful functor is made precise by a classihcation of functors due to 
. This requires some machinery. A functor F : C —D is said to be full 
if for every pair of objects A, B of C, the map F : hom(A, B) —)■ hom(F(A), F{B)) induced 
by F is surjective, where hom(A, B) is the collection of arrows from A to B. Likewise, F is 
faithful if this induced map is injective for every such pair of objects. Finally, a functor is 
essentially surjective if for every object X of D, there exists some object A of C such that 
F{A) is isomorphic to X. 

If a functor is full, faithful, and essentially surjective, we will say that it forgets nothing. 
A functor F : C —)■ D is full, faithful, and essentially surjective if and only if it is essentially 
invertible, i.e., there exists a functor G : D —)■ C such that G o F : C ^ C is naturally 
isomorphic to Ic, the identity functor on C, and F o G : D — D is naturally isomorphic to 
Id- (Note, then, that G is also essentially invertible, and thus G also forgets nothing.) This 
means that for each object A of C, there is an isomorphism rjA '■ G o F{A) —)• A such that 
for any arrow / : A —F in C, o G o F(/) = / o tja, and similarly for every object of D. 
When two categories are related by a functor that forgets nothing, we say the categories are 


The idea of a 


Baez et al. 


(I2nn4 


^*^1 will take for granted the basic definitio ns of category theory—that is, d efinitions of category and 
functor —but no more. For these notions, see Mac Laiig ( 1998 1. Leinster ( 2014h . and other papers in this 
volume. 
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equivalent and that the pair F, G realizes an equivalence of categories. 

Conversely, any fnnctor that fails to be fnll, faithfnl, and essentially snrjective forgets 
something. Bnt fnnctors can forget in different ways. A fnnctor F : C —)■ D forgets structure 
if it is not full; properties if it is not essentially surjective; and stuff' if it is not faithful. Of 
course, “structure”, “property”, and “stuff” are technical terms in this context. But they 
are intended to capture our intuitive ideas about what it means for one kind of object to 
have more structure (resp., properties, stuff) than another. We can see this by considering 
some examples. 

For instance, the functor F ; Top —)■ Set described above is faithful and essentially 
surjective, but not full, because not every function is continuous. So this functor forgets 
only structure—which is just the verdict we expected. Likewise, there is a functor G : 
AbGrp Grp from the category AbGrp whose objects are Abelian groups and whose 
arrows are group homomorphisms to the categry Grp whose objects are (arbitrary) groups 
and whose arrows are group homomorphisms. This functor acts as the identity on the 
objects and arrows of AbGrp. It is full and faithful, but not essentially surjective because 
not every group is Abelian. So this functor forgets only properties: namely, the property of 
being Abelian. Finally, consider the unique functor H : Set 1, where 1 is the category 
with one object and one arrow. This functor is full and essentially surjective, but it is not 
faithful, so it forgets only stuff—namely all of the elements of the sets, since we may think of 
1 as the category whose only object is the empty set, which has exactly one automorphism. 

In what follows, we will say that one sort of object has more structure (resp. properties, 
stuff) than another if there is a functor from the first category to the second that forgets 
structure (resp. properties, stuff). It is important to note, however, that comparisons of this 
sort must be relativized to a choice of functor. In many cases, there is an obvious functor to 
choose—i.e., a functor that naturally captures the standard of comparison in question. But 
there may be other ways of comparing mathematical objects that yield different verdicts. 
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For instance, there is a natnral sense in which gronps have more strncture than sets, since 
any group may be thought of as a set of elements with some additional structure. This 
relationship is captured by a forgetful functor F : Grp —Set that takes groups to their 
underlying sets and group homomorphisms to their underlying functions. But any set also 
uniquely determines a group, known as the free group generated by that set; likewise, func¬ 
tions generate group homomorphisms between free groups. This relationship is captured by 
a different functor, G : Set —)■ Grp, that takes every set to the free group generated by it 
and every function to the corresponding group homomorphism. This functor forgets both 
structure (the generating set) and properties (the property of being a free group). So there 
is a sense in which sets may be construed to have more structure than groups. 


3. Classical Spacetime Structure 


To get a feel for how to apply these ideas to issues in philosophy of physics, I will begin 
by trans l ating s ome well-understood examples in to the present terms. In particular, John 


EarmanI f 1989bl. Ch. 2), building on 


Stein! (119671) and others, characterizes several classical 


spacetime structures that have been of interest historically. To focus on just two of the 
most important (the present discussion is easily extende^, Galilean spacetime consists in 
a quadruple V), where M is the manifold ta is a one form on M; is a 

smooth, symmetric tensor held of signature (0,1,1,1), and V is a hat covariant derivative 
operator. We require that ta and be compatible in the sense that talF^ = 0 at every 
point, and that V be compatible with both tensor helds, in the sense that Va4 = 0 and 
= 0 . 

The points of M represent events in space and time. The held ta is a “temporal metric”, 


^^More generally, the manifolds I consider throughout the chapter are smooth, Hausdorff, paracompact, 
and connected. All fields defined on these manifolds are likewise assumed to be smooth, unl ess I indicate 
otherwise. I work in the abstract index not ation, as d e velop ed bv jPenrose and Rindlei (1984). For further 
details on classical spacetime structure, see iMalamend (l2012l Ch. 4). 
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assigning a “temporal length” |ta^“| to vectors at a point p G M. Since is simply 
connected, Vah = 0 implies that there exists a smooth function t : M —)■ M such that 
ta = Vah We may thus dehne a foliation of M into constant-t hypersurfaces representing 
collections of simultaneous events—i.e., space at a time. We assume that each of these 
surfaces is diffeomorphic to and that restricted these surfaces is (the inverse of) a 
flat, Euclidean, and complete metric. In this sense, may be thought of as a spatial metric, 
assigning lengths to spacelike vectors, all of which are tangent to some spatial hypersurface. 
We represent particles propagating through space over time by smooth curves whose tangent 
vector called the 4-velocity of the particle, satishes ^°‘ta = 1 along the curve. The 
derivative operator V then provides a standard of acceleration for particles, which is given 
by Thus, in Galilean spacetime we have notions of objective duration between 

events; objective spatial distance between simultaneous events; and objective acceleration 
of particles moving through space over time. 

However, Galilean spacetime does not support an objective notion of the (spatial) ve¬ 
locity of a particle. To get this, we move to Newtonian spacetime, which is a quintuple 
{M,ta,h °'^The first four elements are precisely as in Galilean spacetime, with the 
same assumptions. The hnal element, is a smooth vector held satisfying rj°'ta = 1 and 
= 0. This held represents a state of absolute rest at every point—i.e., it represents 
“absolute space”. This held allows one to dehne absolute velocity: given a particle passing 
through a point p with 4-velocity the (absolute, spatial) velocity of the particle at p is 

There is a natural sense in which Newtonian spacetime has strictly more structure than 
Galilean spacetime: after all, it consists of Galilean spacetime plus an additional element. As 
Earman observes, this judgment may be made precise by observing that the automorphisms 
of Newtonian spacetime—that is, its spacetime symmetries—form a proper subgroup of 
the automorphisms of Galilean spacetime. The intuition here is that if a structure has more 
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symmetries, then there must be less structure that is preserved by the mapso In the case of 
Newtonian spacetime, these automorphisms are diffeomorphisms : M —)■ M that preserve 
ta, h°‘^, V, and 77 “. These will consist in rigid spatial rotations, spatial translations, and 
temporal translations (and combinations of these). Automorphisms of Galilean spacetime, 
meanwhile, will be diffeomorphisms that preserve only the metrics and derivative operator. 
These include all of the automorphisms of Newtonian spacetime just described, plus Galilean 
boosts. 

It is this notion of “more structure” that is captured by the forgetful functor approach 
described above. To recapitulate Barman, we dehne two categories. Gal and New, which 
have Galilean and Newtonian spacetime as their (essentially unique) objects, respectively, 
and have automorphisms of these spacetimes as their arrowso Then there is a functor 
F : New —)■ Gal that takes arrows of New to arrows of Gal generated by the same 
automorphism of M. This functor is clearly essentially surjective and faithful, but it is not 
full for reasons already discussed, and so it forgets only structure. Thus the criterion of 
structural comparison given above perfectly recapitulates Barman’s condition—and indeed, 
may be seen as a generalization of the latter to cases where one is comparing collections of 
models of a theory, rather than individual spacetimes. 

To see this last point more clearly, let us move to another well-trodden example. There 
are two approaches to classical gravitational theory: (ordinary) Newtonian gravitation (NG) 
and geometrized Newtonian gravitation (GNG), sometimes known as Newton-Gartan the¬ 
ory. Models of NG consist of Galilean spacetime as described above, plus a scalar held (p, 
representing a gravitational potential. This held is required to satisfy Poisson’s equation. 


^^This way of comparing structure is also explored by Barrett ( 2014lJ ). under the name SYM*. He goes 
on to argue that Minkowski spacetime (see fn. mi) also has less structure than Newtonian spacetime, but 
that Galilean and Minkows ki spacetime s have incomparable amounts of structure. 

^^For simplicity, following iBaez et al. ( 2004h . I am restricting attention to groupoids, i.e., categories with 
only isomorphisms. There are several ways in which one could add arrows to the categories I discuss here 
and in what follows, but nothing turns on how or whether one includes them. 




















= 47rp, where p is a smooth scalar field representing the mass density on spacetime. 
In the presence of a gravitational potential, massive test point particles will accelerate ac¬ 
cording to —V“(p, where is the 4-velocity of the particle. We write models as 

The models of GNG, meanwhile, may be written as quadruples (M,G,h“^,V), where 
we assume for simplicity that M, and are all as described above, and where V 
is a covariant derivative operator compatible with ta and /i“^. Now, however, we allow 
V to be curved, with Ricci curvature satisfying the geometrized Poisson equation. Rat = 
Airptatb, again for some smooth scalar field p representing the mass density] ^^1 In this theory, 
gravitation is not conceived as a force: even in the presence of matter, massive test point 
particles traverse geodesics of V—where now these geodesics depend on the distribution of 
matter, via the geometrized Poisson equation. 

There is a sense in which NG and GNG are empirically equivalent: a pair of results due 
to iTrantmanI fjl965l ) guarantee that (1) given a model of NG, there always exists a model 


of GNG with the same mass distribution and the same allowed trajectorie s for massive tes 


point particles, and (2), with some further assumptions, vice versa (see 

Malament. 

2012 

§4.2). But in a number of influential articles. 

Glvmour 

(1970, 

1977, 

1980 

has argued that 


these are nonetheless inequivalent theories, because of an asymmetry in the relationship just 
described. Given a model of NG, there is a unique corresponding model of GNG. But given 
a model of GNG, there are typically many corresponding models of NG. Thus, it appears 
that NG makes distinctions that GNG does not make (despite the empirical equivalence), 
which in turn suggests that NG has more structure than GNG. 


^^Here and throughout, we suppress the source terms in differential equations when writing models of 
a theory, since such fields can generally be defined from the other fields—so, for instance, given a model 
ofNG, we define an associated mass density by Poisson’s equation. 

^®The Ricci tensor associated with a covariant derivative operator V is defined by Rbc = R°'bca, where 
R°‘bcd, is the Riemann tensor, which is the unique tensor field such that given any point p and any smooth 
vector field defined on a neighborhood of that point, R°‘bcd^^ = The Riemann tensor vanishes 

iff the derivative operator is flat. 
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This intuition, too, may be captured using a forget functor. Define a category NG whose 
objects are models of NG (for various mass densities) and whose arrows are automorphisms 
of M that preserve ta, V, and (p; and a category GNG whose objects are models of 
GNG and whose arrows are automorphisms of M that preserve G, and V. Then there 
is a functor F : NG ^ GNG that takes each model of NG to the corresponding model 


given by the Trautman results, a nd takes eac 


diffeomorphismO Then results in 


Weatherall 


1 arrow to an arrow generated by the same 


2015al) imply the following. 


Proposition 1. F : NG —)■ GNG forgets only structure. 


4. Excess Structure and “Gauge” 

The relationship between NG and GNG captured by Prop. [T]is revealing. As noted, there 
is a sense in which NG and GNG are empirically equivalent—and more, there is a sense in 
which the theories are capable of representing precisely the same physical situations. And 
yet, NG has more structure than GNG. This suggests that NG has more structure than 
is strictly necessary to represent these situations, since after all, GNG can do the same 
representational work with less structure. Theories with this character are sometimes said 
to have “excess structure” and are often called gauge theories by physicists^ 

The archetypal example of a gauge theory is classical electromagnetism. To be clear 
about the sense in which electromagnetism has excess structure, I will present two charac¬ 
terizations of its models. (For simplicity, I limit attention to electromagnetism in a hxed 
background of Minkowski spacetime, (M, 77ab))|j£| On the hrst formulation, which I will call 
EMi, the fundamental dynamical quantity is a two form Fab, known as the Faraday tensor. 
This tensor field represents the electromagnetic held at each point O It is required to satisfy 


^®That this is a functor is established in I Weath erall ( ^ISall. 

^’^For more on the term “gauge theory”, see IWeatheralll (l2015ell . 

Minkowski spacetime is a relativistic spacetime consisting of the man ifold wi t h a f lat, complete 
Lorentzian metric gab- For more on relativistic spacetimes, see section |6] and Malament J 201^ . 

^®For more on how to recover electric and magnetic fields from Fab, see lMalamentI (j2012[ S2.61. 
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Maxwell’s equations, which may be written as daFbc = 0 and VaF°'^ = where d is the 
exterior derivative, V is the Minkowski spacetime derivative operator, and is the charge 
current density associated with any charged matter present in spacetime. On the second 
formulation, EM 2 , the fundamental dynamical quantity is a one form, Aa, known as the 
vector potential. The vector potential is required to satisfy On 

the hrst formulation, a model of electromagnetism may be represented as an ordered triple, 
(M, rjab, Fab)] on the second, a model would be a triple (M, rjab, Ag). 

As with NG and GNG above, there is a close relationship between EMi and EM 2 . 
Given a model of EM 2 , {M,r]ab, Ag), I can always construct a model of EMi that satishes 
Maxwell’s equations for the same charge-current density, by dehning Fgb = dgAb. Gonversely, 
given a model {M,rigb, Fgb) of EMi, there always exists a suitable vector potential Ag such 
that Fgb = dgAb- On both formulations, the empirical signihcance of the theory is taken 
to be captured by the associated Faraday tensor—directly for models of EMi, and by the 
relationship Ag 1 —)■ dgAb for EM 2 . And so, we have a clear sense in which the two formulations 
are empirically equivalent. 

These formulations of electromagnetism are analogous to NG and GNG, as follows. While 
every model of EM 2 gives rise to a unique model of EMi, there are typically many models 
of EM 2 corresponding to any given model of EMi. Once again, this appears to have the 
consequence that EM 2 makes distinctions that EMi does not—and that these distinctions 
between models of EM 2 add nothing to the empirical success of the theory. Again, this 
relationship is captured by a forgetful functor: dehne a category EMi with models of EMi 
as objects and isometries of Minkowski spacetime that preserve Fgb as arrows; and dehne a 
category EM 2 with models of EM 2 as objects, and isometries of Minkowski spacetime that 
preserve Ag as arrows. We may then dehne a functor F : EM 2 —)■ EMi that acts on objects 
via An 1-4 — d gAb, and which takes arrows to arrows generated by the same isometry. 


Weatheralll (j2ni5el ) then proves the following. 


11 





Proposition 2. F : EM 2 —)■ EMi forgets only structure. 


We thus have a sense in which EM 2 has more strnctnre than EMi—and indeed, since EMi 
and EM 2 are taken to have the same representational capacities, it would seem that EM 2 
must have excess structure. This raises a question, thongh. If EM 2 has excess strnctnre, 
why do physicists use it? The answer is pnrely pragmatic: vector potentials are often more 
convenient to work with than Faraday tensors. Moreover, the excess strnctnre does not 
cause any problems for the theory, because physicists are well aware that it is there, and it 
is easily controlled. In particular, if Aa is a vector potential, then A'^ = Aa + Va'0, for any 
smooth scalar field fj, will be snch that daAa = daA'f^] moreover, A'^^ will satisfy the relevant 
differential eqnation for a given charge-current density jnst in case Aa does. The map that 
takes to yl'“ is an example of a gauge transformation. Vector potentials related by gange 
transformations are taken to be physically eqnivalent—even thongh they are mathematically 
inequivalent, in the sense that they concern distinct helds on Minkowski spacetime. 

Once we take gange transformations into acconnt, the two formnlations are nsnally taken 
to be eqnivalent ways of presenting electromagnetism. This relationship can also be captnred 
in the langnage we have been developing. In effect, that the fnnctor F : EM 2 —)■ EMi 
dehned above fails to be fnll reflects the fact that, on the natnral way of relating models of the 
theories, there are arrows “missing” from EM 2 . The gange transformations, meanwhile, may 
be nnderstood as additional arrows—that is, they provide a second notion of isomorphism 
between models of EM 2 that preserves the strnctnre that we take to have representational 
signihcance in physics. These arrows can be added to EM 2 to dehne a new category, EM 2 , 
which has the same objects as EM 2 , but whose arrows / : {M,riab, Aa) {M,r]ab, ^'a) are 
now pairs / = (y, Ga), where y : M —)■ M is an isometry of Minkowski spacetime, Ga = Va'^ 
for some smooth scalar field ip, and x*{^'a) = + Ga. We may then dehne a new fnnctor 

F' : EM '2 ^ EMi, which has the same action on objects as F, bnt which acts on arrows as 
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(x, Ga) XO IWeatherad (j2ni5al ) then proves the following 


Proposition 3. F' : EM 2 —)• EMi forgets nothing. 


As 


Weatherall fj2015al) also shows, one can identify an analogous class of gauge transfor¬ 


mations between models of NG. These additional transformations, which reflect the fact that 
Newtonian physics cannot distinguish a state of inertial motion from one of uniform linear 
acceleration, have arguably been recognized as equivalences between models of Newtonian 
gravitation since Newton’s Principia —indeed, one m^interpret Corollary VI to the Laws of 
Motion as describing precisely these transformations]^^! More, one can dehne an alternative 
category with models of NG as objects, and with these Newtonian gauge transformations 
included among the arrows; this new category is then equivalent to GNG. 


5. Yang-Mills Theory and Excess Structure 

The discussion of gauge theories in the previous section makes precise one sense in which 
electromagnetism and Newtonian gravitation have excess structure: there are textbook for¬ 
mulations of these theories that have structure that apparently plays no role in how the 
theories are used, in the sense that there are other formulations of the theories that have 
the same representational capacities, but which have less structure in the sense described by 
Props. [H and [21 In many cases of interest, however, we do not have multiple formulations 
to work with, and we are confronted with the question of whether a particular formulation 
has excess structure without a comparison class. 

It is here that the relationship between EM 2 and EM 2 becomes especially important. 
What we see in Prop. [3] is that the excess structure in EM 2 , as captured by Prop. [21 
may be “removed” by identifying an additional class of “gauge transformations” that relate 


^°That EM 2 is a category, and that F' is a functor, is show n in Weather^ (2015a i. 

^^For more on t he significance of Corollary VI, see DiSallel ( 2008 ). Saunderg ~( 2013 ). Knox ( 2014 ). and 


Weatherall! (|2015d) . 
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non-isomorphic, but physically equivalent, models of the theories. This observation provides 
us with an alternative criterion for identifying when a formulation of a theory has excess 
structure: namely, when there are models of the theory that we believe have precisely the 
same representational capacities, but which are not isomorphic to one anotherWith this 
criterion in mind, I will now turn to other theories that are often called gauge theories, to 
try to identify whether they have excess structure. In this section, I will consider (classical) 
Yang-Mills theory; in the next section, general relativity. 

Classically, a model of Yang-Mills theory consists in a principal connection on a 
principal bundle G ^ P ^ M with structure group G, over a relativistic spacetime (M, gab) 
(see section [6]) o We will write these models as (P, gab, The principal bundle P may be 
thought of as a bundle of frames for associated vector bundles V P XqV ^ M, sections 
of which represent distributions of matter on space-time. The principal connection 
determines a unique derivative operator on every vector bundle; this derivative operator is 
the one appearing in matter dynamics, and in this way the connection affects the evolution 
of matter. Conversely, every matter held is associated with a horizontal and equivariant 
Lie algebra valued one form J^a on P, called the charge-current density. The principal 
connection is related to the total charge-current density on P by the Yang-Mills equation, 

U) U) 

where is the exterior covariant derivative relative to * is a Hodge 
star operator on horizontal and equivariant forms on P determined by the spacetime metric 
gab on M, and is the curvature associated with u^a- Thus, Yang-Mills theory may be 
understood as a theory in which charged matter propagates in a curved space, the curvature 


^^This criterion is also discussed in Weatherall ( 2015el) . 

^^One also requires an inner product on the Lie algebra associated to G, but that will play no role i n 
the fol l owing . For more on t he principal bundle forma lism for Yang-M i lls theo ry, see Trautmai] ( IQSOll . 
Palais! ( 1981 1. Bleecker ( 1981 1. Deligne and FreedI ( 1999l l. and Weather^ ( 2ni5b l. The no tation used here, 


a variant of the abstract index notation, is explained in an appendix to Weatheralll ( 2015ljl . Briefly, fraktur 
indices indicate valuation in a Lie algebra; lower case Greek indices label tangent vectors to the total space 
of a principal bundle; and lower case Latin indices label tangent vectors to spacetime. So, for instance, uj^a 
is a Lie algebra-valued one form on P, mapping tangent vectors at a point of P to the Lie algebra associated 
to G. 
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of which is dynamically related to the distribution of charged matter. 


The connection, curvature, and charge-current density may all be represented as fields 
on M by choosing a local section a : U ^ M ^ P and considering the pullbacks of these 
fields along the section. The resulting fields are generally dependent on the choice of section; 
changes of section are known as “gauge transformations”. Note, however, that these gauge 
transformations are strongly disanalogous to the ones encountered in section 01 they are 
not maps between distinct models of the theory, and so they do not indicate that there 
are non-isomorphic models of the theory that have the same representational capacities. 
Instead, they are most naturally construed as changes of local coordinate system—or, frame 
field—relative to which one represents invariant fields on P as fields on 

In fact, it seems that Yang-Mills theory as just described does not have excess structure 
at all. One way of emphasizing this point is to observe that classical electromagnetism is a 
Yang-Mills theory in the present sense, which means we can compare EMi and EM 2 with 
a third formulation of the theory, EM 3 . EM 3 is a Yang-Mills theory with structure group 
U{1). The Lie algebra associated with this group is M, and so Lie algebra valued forms are 
just real valued, which means we can drop the index 21 from the curvature and connection 
forms. Continuing as above, we limit attention to Minkowski spacetime. Thus a model of 
the theory is a triple (P, 77 ^ 6 , Wq,), where cUq is a principal connection on P(l) —)■ P A M, 
the unique (necessarily trivial) P(l) bundle over Minkowski spacetime. Given such a model, 
we may generate models of EMi or EM 2 by choosing a (global) section cr : M —)■ P, and 
defining a Faraday tensor Fab = cr*{flai 3 ) or a vector potential Aa = a*(LJa), respectively. 
Since P(l) is Abelian, Fab is independent of the choice of section; Aa, however, depends on 
the section, with different sections producing different (gauge related) vector potentials. If 


^^One could recover a sense in which changes of section were gauge transformations in the other sense, 
discussed above, by stipulating that models come equipped with a preferred choice of section. But this is not 
natural, for multiple reasons—the most important of which is that in general, there are no sections defined 
on all of M. 
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oJa satisfies the Yang-Mills equation for a charge-current density Ja, then Aa and Fab will 
satisfy their respective equations of motion for a held Ja = cr*{Ja)- 

We now dehne a category EM3 whose objects are models of EM3 and whose arrows 
are principal bundle isomorphisms that preserve both Ua and the Minkowski metric rjab- 
There is also a natural functor F : EM 3 — )■ EMi, which acts on objects as (P , r]ah,0Jn.) *—t 


(M, riah , dna*(urv)) (for any global section a) and on arrows as (tI/,'0) t 'ip. 


Weatherall 


fj2015el j then proves the following result. 


Proposition 4. F : EM 3 —)■ EMi forgets nothing. 

It follows that, by the criterion of comparison we have been considering, the principal 
bundle formalism has the same amount of structure as EMi, and less structure than EM 2 . 
This is despite the fact that the dynamical variable of the theory is the connection on 
P, which is analogous—via a choice of section—to the vector potential. The reason the 
equivalence holds is that given any diffeomorphism that preserves the Faraday tensor, there 
is a unique corresponding principal bundle automorphism that also preserves Ua, in effect 
by systematically relating the possible sections of P. 

These variants of electromagnetism on Minkowski spacetime are ultimately toy examples. 
That said, there is another issue in the neighborhood that has been a locus of recent debate. 
It concerns the relationship between the formalism for Yang-Mills theory just described—the 
so-called “hber bundle formalism”—and a formalism known as the “holonomy formalism” or 
“loop formalism”. Each of these is often associated with an “interpretation” of Yang-Mills 
theory, though I will not discuss those interpretationso It is sufficient to note that one 
theme in these debates is the claim that the holonomy formalism posits (or requires) less 
structure than the hber bundle formalism. 


particular, see Healed fe 00 tIi andlArntze niu.s ( 2012) for di scussions of the interpretive options related 
to these formalisms; see also iRosenstock and Weatheralll (l2015bf l. 
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The idea behind the holonomy formalism for Yang-Mills theory is as followso Given a 
hber bundle model of a Yang-Mills theory, {P, gab,oj'^a), and a hxed point u E P, we may 
dehne a map H : —>■ G, where is the collection of piecewise smooth closed curves 

7 : [0,1] — )• M originating (and ending) at p{u). This map assigns to each curve 7 G 
the element g E G such that 7n(l) =^u(0)g', where 7 ^ : [0,1] —>■ F is the (unique) horizontal 
lift of 7 through u, relative to The group element is known as the holonomy 

of 7 . One usually interprets 7^(1) G F to be the parallel transport of u along 7 , relative 
to the principal connection o;^; thus, the holonomies associated with a connection encode 
information about the parallel transport properties of u^a- 

It is arguably the case that the holonomies of a principal connection contain all of the 
empirically signihcant data associated with a hber bundle model of the theoryo The holon¬ 
omy formalism attempts to characterize Yang-Mills theory directly with holonomy data. 
It does so via a generalized holonomy map H : ^ G, which simply assigns group el¬ 


ements to closed curves without ever mentioning a principal bundle. (No t any m a p wi^ 


do; the properties required of a 


and 


genera 


ized holonomy map are described in 


BarrettI ( 19911 1 


Rosenstock and Weatheralll (j2ni5al ).) Many commentators have had the intuition that 


this approach is more parsimonious than the hber bundle formalism, since it posits just the 
structure needed to encode the possible predictions of the theory, without any geometrical 
superstructure. 

But can this thesis of relative parsimony be made precise? It is not clear that it can be. In 
fact, the methods described here give a strikingly diherent answer. For any given Yang-Mills 
theory, circumscribed by some hxed choice of structure group G, we may dehne two categories 


^^Fo r det ails, see iRusen stock an d Weatheralll (2015a); for further background, see Barrett ( 1991 1. Lolll 
(1994), and lCambini and PullinI ( 1996ll . 

^"^The horizontal lift of a curve 7 : [0,1] —)• M through u £ p~^ [7(0)] is the unique c urve "I'v. : 10,1] — t F such 
that 7T o 7„ = 7 and = 0 along the curve, where is the tangent to 7„. See iKobavashi and Nomizu 

(Il96,l Ch. 2 ). 

^^See iHealevI (2007) for a defense of this claim. It is usually motivated by arguing that the empirical 
significance of a Yang-Mills theory concerns only interference effects exhibited by quantum particles. 
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of models, associated with each of these formalisms. The first, PC, is a generalization of 
EM 3 : the objects are hber bundle models {P, gabjOj'^a), with structure group G; and the 
arrows are principal bundle isomorphisms that preserve the connection and metric. The 
second. Hoi, corresponds to the holonomy formalism. Here the objects are holonomy models, 
(M, gab, H : Lx ^ G), where H is a generalized holonomy map, and the arrows are holonomy 
isomorphisms, which are maps that preserve th e metric and holonomy struc t ure. (T hese are 


somewhat subtle, and are described in detail in 


Rosenstock and Weatheralll fl2015al) .l There 


is a functor F : Hoi —PC that takes every holonomy model {M,gab, H) to a fiber bundle 
model that gives rise to the holonom ies H, with appropriately compatible action on arrows. 


Rosenstock and Weatheralll (j2015al) then prove the following result. 


Proposition 5. F ; Hoi —?■ PC forgets nothing. 


In other words—philosophers’ intuitions notwithstanding—the formalisms underlying 
holonomy and fiber bundle interpretations have precisely the same amounts of structure, 
relative to a natural standard of comparison between models of the theories]^ 


6 . General Relativity, Einstein Algebras, and the Hole Argument 

Finally, I will return to spacetime physics, to discuss the methods above in the context of 


general relativity. I will briefly discuss two rela ted t opics. 


‘hole argument” of 


Earman and Norton! (119871 ) and 


Stachel 


le first concerns the so-called 


19891 ): the second concerns a 


proposa l for a n alternative to the standa rd forma l ism of 


Gerochl (119721) . and later championed by 


EarmanI (Il977al]bl . 


ge neral 


1979, 


relativity originally due to 


1986 


1989a 


M)B 


Space constraints prevent further elaboration on this point; see Rosenstock and Weatherall ( 2015lJ) for 
further details. 

^°For more on the hole argument, see Earman ( 1989b ) and PoolevI ( 201,l[) : see also Weather^ ( 2015d[) . 
which extends the arguments given here. Stache]l ~( 20l4) also provides a detailed discussion of the hole 
argument in the context of category theory, though the perspective is somewhat different from that adopted 
here. 
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To begin, the models of relativity theory are relativistic spacetimes, which are pairs 
(M, Qab) consisting of a 4-manifold M and a smooth, Lorentz-signature metric (/abO The 
metric represents geometrical facts about spacetime, such as the spatiotemporal distance 
along a curve, the volume of regions of spacetime, and the angles between vectors at a point. 
It also characterizes the motion of matter: the metric gab determines a unique torsion-free 
derivative operator V, which provides the standard of constancy in the equations of motion 
for matter. Meanwhile, geodesics of this derivative operator whose tangent vectors satisfy 
> 0 are the possible trajectories for free massive test particles, in the absence of 
external forces. The distribution of matter in space and time determines the geometry of 
spacetime via Einstein’s equation, Rab — \Rgab = 87rTab, where Tab is the energy-momentum 
tensor associated with any matter present, Rab is the Ricci tensor, and R = R°^a- Thus, 
as in Yang-Mills theory, matter propagates through a curved space, the curvature of which 
depends on the distribution of matter in spacetime. 

The most widely discussed topic in the philosophy of general relativity over the last thirty 
years has been the hole argument, which goes as follows. Fix some spacetime {M,gab), and 
consider some open set O C M with compact closure. For convenience, assume Tab = 0 ev¬ 
erywhere. Now pick some diffeomorphism ijj : M ^ M such that acts as the identity, 

but is not the identity. This is sufficient to guarantee that 'i/' is a non-trivial automor¬ 
phism of M. In general, 'i/' will not be an isometry, but one can always dehne a new spacetime 
(M, i^*{gab)) that is guaranteed to be isometric to (M, gab), with the isometry realized by '0. 
This yields two relativistic spacetimes, both representing possible physical conhgurations, 
that agree on the value of the metric at every point outside of O, but in general disagree 
at points within O. This means that the metric outside of O, inclu ding at all points in 


the past of O, cannot determine the metric at a point p E O. Thus, 


Earman and Norton 


■^^My notational conventions again follow IMalamend (j2ni2h . In particular, I work with a signature 
(1, —1, —1, —1) metric. 
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(119871) argue, general relativity, as standardly presented, faces a pernicious form of inde¬ 


terminism. To avoid this indeterminism, one must become a relationist and accept that 
“Leibniz equivalent”, i.e., isometric, spacetimes represent the same physical situations. The 
person who denies this latter view—and thus faces the indeterminism—is dubbed a manifold 
substantivalist. 

One way of understanding the dialectical context of the hole argument is as a dispute 
concerning the correct notion of equivalence between relativistic spacetimes. The manifold 
substantivalist claims that isometric spacetimes are not equivalent, whereas the relationist 
claims that they are. In the present context, these views correspond to different choices 
of arrows for the categories of models of general relativity. The relationist would say that 
general relativity should be associated with the category GRi, whose objects are relativistic 
spacetimes and whose arrows are isometries. The manifold substantivalist, meanwhile, would 
claim that the right category is GR 2 , whose objects are again relativistic spacetimes, but 
which has only identity arrows. Clearly there is a functor F : GR 2 —)■ GRi that acts 
as the identity on both objects and arrows and forgets only structure. Thus the manifold 
substantivalist posits more structure than the relationist. 

Manifold substantivalism might seem puzzling—after all, we have said that a relativistic 
spacetime is a Lorentzian manifold (M, gab), and the theory of pseudo-Riemannian manifolds 
provides a perfectly good standard of equivalence for Lorentzian manifolds qua mathematical 
objects: namely, isometry. Indeed, while one may stipulate that the objects of GR 2 are 
relativistic spacetimes, the arrows of the category do not reflect that choice. One way of 
charitably interpreting the manifold substantivalist is to say that in order to provide an 
adequate representation of all the physical facts, one actually needs more than a Lorentzian 
manifold. This extra structure might be something like a hxed collection of labels for 
the points of the manifold, encoding which point in physical spacetime is represented by 
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a given point in the manifoldo Isomorphisms would then need to preserve these labels, 
so spacetimes would have no non-trivial automorphisms. On this view, one might use 
Lorentzian manifolds, without the extra labels, for various purposes, but when one does so, 
one does not represent all of the facts one might (sometimes) care about. 

In the context of the hole argument, isometries are sometimes described as the “gauge 
transformations” of relativity theory; they are then taken as evidence that general relativity 
has excess structure. But as I argued in section [5l one can expect to have excess structure 
in a formalism only if there are models of the theory that have the same representational 
capacities, but which are not isomorphic as mathematical objects. If we take models of 
GR to be Lorentzian manifolds, then that criterion is not met: isometries are precisely the 
isomorphisms of these mathematical objects, and so general relativity does not have excess 
structure. 

This point may be made in another way. Motivated in part by the idea that the standard 
formalism has excess structure, Earman has proposed moving to the alternative formalism 
of so-called Einstein algebras for general relativity, arguing that Einstein algebras have 
less structure than relativistic spacetimeso In what follows, a smooth n—algebra A is an 
algebra isomorphic (as algebras) to the algebra C°°{M) of smooth real-valued functions on 
some smooth n—manifold. Mo A derivation on A is an M-linear map ^ : A ^ A satisfying 
the Leibniz rule, ^{ab) = a^{b) + b^{a). The space of derivations on A forms an A-module, 
r(A), elements of which are analogous to smooth vector fields on M. Likewise, one may 
define a dual module, r*(A), of linear functionals on r(A). A metric, then, is a module 
isomorphism g : r(A) r*(A) that is symmetric in the sense that for any ^,rj & r(A), 
9(0 iv) = 9{v){0- With some further work, one can capture a notion of signature of such 


similar sugg estion is made bv IStachell (|1993ll . 

^^For deta i ls, see Rosenstock et al.r ( 201!T[ l. In many ways, the arguments here are reminiscent of those of 
Rvnasiewicz ( 1992tl . 

These may also be characterized in purely algebraic terms ( Nestrued . l200.'l[ l. 
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metrics, exactly analogously to metrics on a manifold. An Einstein algebra, then, is a pair 
{A,g), where A is a smooth 4—algebra and is a Lorentz signature metric. 


Einstein algebras arguably provide a “relationist” formalism for general relativity, since 
one specihes a model by characterizing (algebraic) relations between possible states of mat¬ 
ter, represented by scalar helds. It turns out that one may then reconstruct a unique 
relativistic spacetime, up to isometry, from these relations by representing an Einstein al¬ 
gebra as the algebra of functions on a smooth manifold. The question, though, is whether 
this formalism really eliminates structure. Let GRi be as above, and dehne EA to be 
the category whose objects are Einstein algebras and whose arr ows are algebra homomo r- 


phisms that preserve the metric g (in a way made precise by 


Rosenstock et ah 


(1201.511 1 . 


Dehne a contravariant functor F : GRi —> EA that takes relativistic spacetimes (M, gab) 
to Einstein algebras {C°°{M),g), where g is determined by the action of gab on smooth 
vector helds on M, and takes isometries t)) : (M, Pab) ^ (M^ qRl to alge b ra iso morphisms 


-0 : C°°{M') —)■ dehned by ijj{a) = a o -0^2 

following. 


Rosenstock et ah 


(1201.51 ) prove the 


Proposition 6. F : GRi —)■ EA forgets nothing. 


7. Conclusion 

I have reviewed several cases in which representing a scientihc theory as a category of 
models is useful for understanding the structure associated with a theory. In the context of 
classical spacetime structure, the category theoretic machinery merely recovers relationships 
that have long been appreciated by philosophers of physics; these cases are perhaps best 
understood as litmus tests for the notion of “structure” described here. In the other cases, 
the new machinery appears to do useful work. It helps crystalize the sense in which EM 2 

contravariant functor is one that takes arrows f : A ^ B to arrows F{f) : F{B) F{A). The 
classification described above carries over to contravariant functors, though two categories related by a full, 
faithful, essentially surjective contravariant functor are said to be dual, rather than equivalent. 

22 











and NG have excess structure, in a way that clarihes an important distinction between 
these theories and other kinds of gauge theories, such as Yang-Mills theory and general 
relativity. It also clarihes the relationship between various formulations of physical theories 
that have been of interest to philosophers because of their alleged parsimony. These results 
seem to rehect real progress in our understanding of these theories—progress that apparently 
required the basic category theory used here. One hopes that these methods may be extended 
further—perhaps to issues concerning the relationships between algebras of observables and 
their representations in quantum held theory and the status of dualities in string theoryjf^ 
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